Analytical Confidence Intervals for the Number of Different Objects in Data Streams
نویسندگان
چکیده
This paper develops a new mathematical-statistical approach to analyze class of Flajolet-Martin algorithms (FMa), and provides analytical confidence intervals for the number F0 distinct elements in stream, based on Chernoff bounds. The FMa has reached significant popularity bigdata stream learning, attention literature mainly been algorithmic aspects, basically complexity optimality, while statistical analysis these often faced heuristically. provided here shows deep connections with mathematical special functions extreme value theory. latter connection may help explaining heuristic considerations, first opens many numerical issues, at end present paper. Finally, are tested an anonymized real data MonteCarlo simulations support our choice this context.
منابع مشابه
Comparison of five introduced confidence intervals for the binomial proportion
So far many confidence intervals were introduced for the binomial proportion. In this paper, our purpose is comparing five well known based on their exact confidence coefficient and average coverage probability.
متن کاملChemistry with confidence: should Clinical Chemistry require confidence intervals for analytical and other data?
Confidence intervals are not commonly provided with analytical or other data reported in Clinical Chemistry although P values are. However, confidence intervals provide an explicit demonstration of the direction and magnitude of uncertainty and are intuitively easy to grasp, unlike P values. It is therefore argued that the Journal should adopt a policy requiring the provision of confidence inte...
متن کاملstudy of cohesive devices in the textbook of english for the students of apsychology by rastegarpour
this study investigates the cohesive devices used in the textbook of english for the students of psychology. the research questions and hypotheses in the present study are based on what frequency and distribution of grammatical and lexical cohesive devices are. then, to answer the questions all grammatical and lexical cohesive devices in reading comprehension passages from 6 units of 21units th...
metrics for the detection of changed buildings in 3d old vector maps using als data (case study: isfahan city)
هدف از این تحقیق، ارزیابی و بهبود متریک های موجود جهت تایید صحت نقشه های قدیمی سه بعدی برداری با استفاده از ابر نقطه حاصل از لیزر اسکن جدید شهر اصفهان می باشد . بنابراین ابر نقطه حاصل از لیزر اسکنر با چگالی حدودا سه نقطه در هر متر مربع جهت شناسایی عوارض تغییر کرده در نقشه های قدیمی سه بعدی استفاده شده است. تمرکز ما در این تحقیق بر روی ساختمان به عنوان یکی از اصلی ترین عارضه های شهری می باشد. من...
Calculating unreported confidence intervals for paired data
BACKGROUND Confidence intervals (or associated standard errors) facilitate assessment of the practical importance of the findings of a health study, and their incorporation into a meta-analysis. For paired design studies, these items are often not reported. Since the descriptive statistics for such studies are usually presented in the same way as for unpaired designs, direct computation of the ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Big Data Research
سال: 2021
ISSN: ['2214-580X', '2214-5796']
DOI: https://doi.org/10.1016/j.bdr.2021.100248